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Objectives 
After working through this unit you should be able to: 
(i) explain in your own words why it is necessary to employ a limiting 
process in forming a derivative; 
(ii) define the following terms: 
average rate of change, 
instantaneous rate of change, 
derivative, = 
derived function, 
differentiable, 
D, D?, D®, etc., where D is the differentiation operator; 
(iii) differentiate (using the table of standard forms): 
sums of functions, 
products of functions, 
quotients of functions, 
inverse functions, 
exponential and logarithmic functions, 
trigonometric functions, 
composite functions involving any of the above; 
(iv) use the method of logarithmic differentiation to obtain derived 
functions; 
(v) determine the gradient of the tangent to a given curve at any given 
point on the curve; 
(vi) determine the velocity and acceleration of a body, given its position 
as a function of time. 


N.B. 

Before working through this correspondence text, make sure you have 
read the general introduction to the mathematics course in the Study 
Guide, as this explains the philosophy underlying the whole course. You 
should also be familiar with the section which explains how a text is 
constructed and the meanings attached to the stars and other symbols 
in the margin, as this will help you to find your way through the text. 
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Glossary Page 


Terms which are defined in this glossary are printed in CAPITALS. 


ACCELERATION ACCELERATION is the rate of change of VELOCITY: it 45 
is the DERIVATIVE of the function: 


time — velocity” 


AVERAGE RATE The AVERAGE RATE OF CHANGE of the image of a 4 
OF CHANGE function / over an interval [x,, x] in its domain is 
S82) = £04) 
= xX 
AVERAGE VELOCITY The AVERAGE VELOCITY over the time interval [1 , f] 1,3 


is 


distance travelled in [t,, 2) 
duration of {t),)  ~ 


CHAIN RULE, The COMPOSITE FUNCTION RULE (also called the 33 
COMPOSITE CHAIN RULE OF FUNCTION OF A FUNCTION RULE) states: 
FUNCTION RULE 

(egy =(f"oa) xg 


where f and g are DIFFERENTIABLE functions. 


DERIVATIVE The DERIVATIVE of a function / at 1 is i 
ye f(t + A) — flO) 
limit pA ess —. 
ho h 

DERIVED FUNCTION The DERIVED FUNCTION of a function f is 17 
fsx > DERIVATIVE of f at x. 


The domain of /” is the subset of the domain of / 
consisting of elements at which fis DIFFERENTIABLE, 


DIFFERENTIABLE A function is said to be DIFFERENTIABLE at those 17 
points of its domain at which the DERIVATIVE exists. 
If no points of the domain are mentioned, we mean 
that the function is differentiable at all points of its 


domain. 

DIFFERENTIATE, To DIFFERENTIATE a function is to find its DERIVED 17 
FUNCTION. 

DIFFERENTIATION The operator D: f-—> /” (fe set of real functions) 23 

OPERATOR is called the DIFFERENTIATION OPERATOR, 

FUNCTION OF A See COMPOSITE FUNCTION RULE. 

FUNCTION RULE 

INSTANTANEOUS, The INSTANTANEOUS RATE OF CHANGE of a function, i 


RATE OF CHANGE J. at the element x in its domain is /'(x). 


INSTANTANEOUS The INSTANTANEOUS VELOCITY of an object at time 9 
VELOCITY ris lim (average velocity over [r, 1 + h]) 

hoo 
INVERSE FUNCTION The INVERSE FUNCTION RULE states: if fis a one-one 41 


RULE function with inverse g, then 


vi 


LOGARITHMIC 
DIFFERENTIATION 


MAGNIFICATION 


PRODUCT RULE 


QUOTIENT RULE 


RATIONAL FUNCTION 


REAL FUNCTION 


SECOND DERIVED 
FUNCTION 


SIMPLE HARMONIC 
MOTION 


SLOPE OF TANGENT 


STANDARD FORMS 


TANGENT 


VELOCITY 


LOGARITHMIC DIFFERENTIATION is a method of 
DIFFERENTIATING a product of functions, /gh say, by 
writing the product in the form: 


Jgh = exp (Inf + Ing + In A) 
or 
In(fgh) = Inf + Ing + Inh, 


before differentiating. This gives: 


(fehy’ ght 
(Gh) fF ee 
The MAGNIFICATION (SCALE FACTOR) of an object is: 
ize of image 


ze of original” 
The PRODUCT RULE states: 


Cafe ay 
where f and g are DIFFERENTIABLE functions, The 
domain of (/g)' is the intersection of the domains of 
f and g'. 
The QUOTIENT RULE states: 


(4) -25# 
g r 


where f and g are DIFFERENTIABLE functions. The 


domain of (4) is the subset of the intersection of 


the domains of f’ and g’ consisting of all those 
elements, x, at Which g(x) # 0, 


A RATIONAL FUNCTION is a quotient of polynomial 
functions (see Unit /), 

A REAL FUNCTION is a function whose domain and 
codomain are subsets of R. 


The SECOND DERIVED FUNCTION of a function / is 
the DERIVED FUNCTION of the derived function of /; 
that is: /":.x-— DERIVATIVE of f’ at.x. The domain 
of f” is the subset of the domain of /’ consisting of 
elements at which /” is DIFFERENTIABLE. 


SIMPLE HARMONIC MOTION is an oscillatory motion 
in which the ACCELERATION al time / is proportional 
to the displacement at time ¢. The displacement, x, 
is of the form: 

x= 


o t+ asin b(t — to), 


where a and b are constants and fy, xy are the initial 
time and the initial displacement respectively. 


The SLOPE OF THE TANGENT (if it exists) at x to a 
curve specified by the equation y = f(x) is f(x). 
STANDARD FORMS are expressions for DERIVED 


FUNCTIONS of polynomial, rational, trigonometric, 
exponential and logarithmic functions. 


The TANGENT (if it exists) at x to the curve specified 
by the equation y = f(x) is the straight line which 
touches the curve at (x, /(x)). 

See AVERAGE VELOCITY and INSTANTANEOUS 
VELOCITY. 
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Notation 


The symbols are presented in the order in which they appear in the text. 


[x1,¥2] 
Ay 
w(ty, t2) 


viii 


The interval {x:xeR,x, <X < Xp, with x, < xp}. 
The finite difference operator. 

The average velocity over the time interval [t,t]. 
The set of positive real numbers. 

The limit of the function f near the point 0. 


The set of negative real numbers. 
The set of positive real numbers together with zero. 
The derivative of f at t. 


Alternative notation for f'(x), where {:x— y. 


The derived function of f. 

The derived function of x-—* x*. 

The set of all real functions. 

The differentiation operator. 

“and”. 

The composite of the functions / and g. 
The image of F under the operator D. 

The image of x under fog. 

The function r?:x+——* r(x) x r(x) (xe R), 


where r:xt—?r(x) (xe R). 
The function mo (xeER A v(x) # 0), 
where ts x-—40(x) (ve R). 


The exponential function. 
The natural logarithm function, 
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12.0 INTRODUCTION 


In this unit we shall be concerned with change, and, in particular, the 
rate at which things change, 


Everything in the physical world about us is changing, sometimes rapidly, 
sometimes very slowly. We know, for example, that plants are growing; 
so their sizes are changing, albeit very slowly, We know too that an insect’s 
wings move as it flies, even though they usually beat so rapidly that it is 
impossible for us to follow the movement with our eyes, An intermediate 
example is the movement of a motor car. Here it is the position of the car 
that is changing, and we have an instrument in the car which measures 
the rate of change of position, namely the speedometer, 


The general aim of this unit is to set up a mathematical scheme for 
describing and measuring rates of change. Because everybody has some 
familiarity with velocity and acceleration, we have chosen motion as our 
starting point. The ideas we shall develop are, however, of great generality. 
They are used not only in kinematics (the study of velocities and accelera- 
tions) but also in studying many other types of rate of change, such as the 
rate of population growth; the rate at which the boiling point of water 
changes with height on a mountain; the rate at which the temperature in a 
room decreases with distance from a radiator, and so on. In the first 
section we are maifly concerned with the mathematical description of these 
rates of change, that is, with the concept of the derivative of a function. In 
the second and third sections we establish some rules for finding 
derivatives. 


As we remarked in Unit 9, our notation for the calculus is different from 
the classical notation which is used in most textbooks; our notation is 
consistent with our basic approach to mathematics through the concept 
of a function. Once you have tered the basic principles, there is no 
objection to your using the traditional Leibniz notation which is discussed 
in Appendix I on page 56. 
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12.0 
Introduction 


Leibniz 
(1646-1716) 


(Mansell) 


12.1 RATES OF CHANGE 
12.1.1 Average Rates of Change 


We have already discussed the concept of average velocity in Unit 7, 
Sequences and Limits I. The purpose of this sub-section is simply to remind 
you how average velocity is defined, and to consider some of the conse- 
quences of the definition. 

To illustrate the idea of average velocity, suppose you made a car journey 
from London to Edinburgh and recorded the total distance covered at 
hourly intervals. The recorded data might be given in the following table: 


Time (h) Distance (km) 


een dsHsyun-o 
Ss 


The table shows that at time 0 the distance travelled is 0, where time 0 is 
the time at which the journey begins. After one hour the car has covered 
15 km, After two hours it has covered 110 km, ic. in the second hour it has 
covered 95 km. In the third hour it has covered 90 km, but in the fourth 
and fifth hours it has only covered 40 and 60 km respectively—perhaps 
you stopped for lunch, Then the car covered 90 km in each of the sixth, 
seventh and eighth hours of the journey. In the ninth hour it only covered 
30 km; perhaps the traffic was heavy near Edinburgh, 

In discussing your trip to Edinburgh with a friend who has made a similar 
long journey you might wish to compare your average velocity with his. 
To calculate it you would divide the distance travelled by the time :** 


distance travelled 


average velocity = tine 


_ 600 km 
pe 
= 66.7 km/h. 


Alternatively, you might wish to compare your average velocities over 
different parts of the journey; for example: 


P distance travelled 
average velocity over first 3 hours = — Mike 


200 km 


= 66.7 km/h 


* Velocity means speed in a known direction. Here we assume that the road is straight so 
that only two directions of motion are possible: away from London and towards it. We 
distinguish them by giving the velocity a positive sign for motion away from London, and 
a negative sign for motion towards it 
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Table I 


600 km — 390km 


i last 3 = 
average velocity over last 3 hours ean 


= 70km/h. 


In each case the method of calculation is an application of the formula 
given in the Introduction to Unit 7, Sequences and Limits I: average 
velocity over time interval [t, , f2] is 


distance travelled in [t,,t2] _ x2 — 1 
duration of [t,, t2] 7 hh 


where x, and x, are the distances from London at the times f, and ¢, 
respectively. The essential restriction on 1, and t, (apart from the obvious 
one that they must be among the instants for which the distances of the 
car from London are given) is that they must be different ; that is, t, > ¢,, 
for if t; = t,, then the fraction in Equation (1) has denominator zero, 
and fractions with zero denominator have no meaning. We shall return 
to this important point in section 12.1.2. 


, distance in km 


Edinburgh 600 


2004 


100 
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Equation (1) 


De seer eeietemenes 


2 
o 
o 
~ 


time in hours: 
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Average velocities have a very convenient representation in terms of 
pictorial graphs. The figure below shows how we can use the graph to 
calculate the average velocity over the time interval [6,9]. In the right- 
angled triangle ABC, the side AB corresponds to a time, 3 hours, and BC 
corresponds to a distance, (600 — 390)km = 210 km. The magnitude of 
the average velocity over (6, 9] is 24° which is the slope of AC (i.e. tan CAB), 


distance in km 
+ 


Similarly, the magnitude of the average velocity over any time interval, 
[t,, 2], as given by Equation (1), is the slope of the straight line joining 
the two points on the pictorial graph corresponding to t, and f,: the unit 
of the average velocity is km/h. 


distance 


The concept which underlies average velocity can be generalized to many 
other situations, if we first pick out its essential feature with the help of 
the concept of a function, In the example of the car journey, the relevant 
function is the one tabulated in Table I; 


J:Atime since start) — (distance travelled). 
In terms of this function, the formula for average velocity is: 


S(tz) = fy) 


fy ty 


average velocity over the interval [r,,f.] = Equation (2) 
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Expressions of the type used in Equation (2) can be useful for many 
other situations involving rates of change. For example, /(r) may represent 
the quantity of water in a reservoir at time t. In this case, the expression 
on the right of Equation (2) has the interpretation 


change in quantity of water in 
f(t) = flty) _ reservoir during [t,t] 
She. duration of [¢,, 2] 


That is, it gives the average rate at which the quantity of water in the 
reservoir changes during [t,.f2]. As another example, f might be the 
function: 

f:(depth below sea level)— (hydrostatic pressure) 


(at some particular time and place). In this case. ifd, and d, are two depths, 


then 


J(d3) — fd) _ corresponding change in pressure 
dj—d, change in depth 


= average rate of change of pressure with depth. 


The value of this fraction is roughly 0.1 atmosphere per metre, An 
aquanaut can use this value to deduce his depth changes from measured 
changes in pressure. 

In general, if f is any real function (i.e. a function whose domain and 
codomain are subsets of R), we define: 


Average rate of change of f(r) over the interval [t, , fz] is Definition 4 
S(t2) = ft) 


bh 


Exercise 1 Exercise 1 
(3 minutes) 


Add a third column to Table | (reproduced below), showing the first 
differences of the tabulated values of the function /, with domain [0,9], 
defined by: 


f:(time since start)— (distance travelled). 


(See Unit 4, Finite Differences for the definition of first differences, and 
also for the notation A,,) 


Time (h) Distance (km) First Difference (km) 


CSwPADAWEwWNHe So 
s 


(i) What is the interpretation of the numbers in the third column in 
terms of average velocity? 
{ii) How is the function A, f connected with average velocity? 


(iii) If we define ida sto be the function: 


rise +h) — f()} (te [0,9 — h)), where 0 <h <9, 


how is this function connected with average velocity? | 


12.1.2. The Need for Instantaneous Velocities 


The average velocities obtainable from the data given in the last section 
give only a very crude indication of the speeds at which the car travels 
during the journey. For example, the car took one hour to cover the first 
15 kilometres. Was this a relatively steady 15 km/h or did the car travel 
at 30 km/h for roughly half the time and spend most of the other half 
standing in traffic jams? 

There is no way of answering this question from the data given in Table 1 
of the preceding section. To answer it we would need more detailed 
information, such as a table showing the total distance after each minute 
of the journey, instead of each hour, Here is part ofa hypothetical table of 
this kind for the London-Edinburgh journey : 


Time (min) Distance (km) 


Auseun-o 
Ss 
a 


60 15.0 


It looks as though there was a traffic jam 0.6 km from the starting point. 
From such a table we can calculate average velocities just as before; for 
example, the average velocity during the third minute was 0.25 km/min, 
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12.1.2 


(continued on page 6) 


Solution 12.1.1.1 


Time (h) Distance (km) First Difference (km) 


0 


0 

1 1s a 
2 110 90 
3 200 a 
4 240 oe 
5 300 ie 
6 390 
7 480 oe 
8 570 a 
9 600 


(i) The numbers in the third column are the average velocities (in km/h) 
during the hour in question: 15 km/h during the first hour, 95 km/h 
during the second hour, etc. (This is because the intervals are each of 
length one hour. If they were each of length two hours, for instance, 
we would have to divide the first differences by two to get average 
velocities.) 

(ii) A, f(t) is the average velocity during the time interval [t,t + 1). 
Again, it is the subscript 1 that allows us to identify A, f(t) with average 
velocity. If we change the interval h, then the average velocity becomes 
1 
h 
(0, 8) only.) 


A, f(t), not just A,,f(t). (Notice that in this case A, is defined for 


(iii) TA, f is the average velocity during the time interval [t,¢ + h]. 


(continued from page 5) 


that is, 15 km/h. But once again, such average velocities give only a rough 
idea of the speed of the car as measured by the speedometer. Was the speed 
during this third minute a steady 15 km/h, or did the driver accelerate to a 
much higher speed during the first part and then brake sharply? The 
table above enables us to locate the traffic jams (assuming that they cause 
the stationary periods) but it gives very little information about how the 
driver used his brakes, for instance, 

To give information about driving habits, the distance function would 
need to be tabulated with an even smaller time spacing; probably a table 
showing the distance travelled after every second of the journey would be 
adequate to provide this type of information. Here is a small part of such a 
hypothetical table: 


Time (s) Distance (m) 
150 570 
151 585 
152 595 
153 600 


The average velocity during the one-second interval [150, 151] is 15 m/s 
(54 km/h); during the next interval it is only 10 m/s, and in the following 
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Solution 12.1.1.1 


interval it is 5m/s, indicating that the driver braked sharply (or had a 
collision). 

In Unit 7, Sequences and Limits I, we pointed out that if the occupants of 
the car had been unfortunately involved in a collision, then even the average 
velocity over the last second before impact would not be of direct interest 
to them, If the driver were braking sharply during the last second, the 
actual impact would occur at a speed of several kilometres per hour less 
than the average speed during the last second, and these few kilometres per 
hour could make a vital difference to the outcome. Clearly then, there are 
some purposes for which even a one-second time interval is too long for 
the computation of a meaningful average velocity over that interval. 


It appears from this discussion that to answer any particular question 
involving velocities (Was the car involved in an accident? Was the driver 
heavy on his brakes? Did the driver accelerate rapidly?) we can, in theory, 
always find a time interval short enough for average velocities over that 
interval to be useful in discussing the question. It would obviously be 
desirable to have a definition of velocity that could be guaranteed in 
advance to satisfy all the demands that might be made on it, instead of 
possibly having to be adjusted after some new demand became known. 
That is, we would like to contract to zero the duration of the time interval 
used in defining velocity, making it not the average velocity over an interval, 
but an instantaneous velocity. 

A first attempt at such a definition might be to use a time interval of zero 
duration in the formula for average velocity, Things are not that easy, 
however, The average velocity over the time interval [¢, ,t)], which we shall 
in future denote by w(t, f,), was defined in Equation 12.1.1.2 by: 


_ fits) = flty) 
W(t). 3) = c= 
where f is the function: 
J:(time since starting) — (distance travelled), 


If we try to use a time interval of zero duration, then we have 1; = t) and 
S(t) = f (tz), so that the fraction on the right-hand side of the equation 


4 eee a ee ; z ear 
turns into the expression 0 which is meaningless, since division by zero 
is not defined. 


Exercise | 


What is the fallacy in the following argument? 


Let 
land y = 1, 
so that 
x 
Then 
Le. 
(x — y)(x + y) = (x — yy. 
So 
(x + »)=y (after cancelling by the factor (x — y)) 

Le. 


1 (since x = | and y = 1), a 


Exercise 1 
(2 minutes) 


Solution 1 


Cancelling a factor in an equation is the same as dividing throughout by 
the factor, and is not permissible when that factor is equal to zero. For 
example, 3 x 0 = 2 x 0, but 3 4 2. Thus, the correct deduction to make 
from the equation 


(x — p(x + ¥) = (x — yy 


xty=yifx—y #0: 
but in this case 


0, 


so that the deduction x + y = y cannot be made. a 


12.1.3. The Definition of Instantaneous Velocity 


In the previous section, using average velocities over shorter and shorter 
time intervals, we obtained expressions for average velocities which be- 
came increasingly s tory as the time intervals decreased, in the sense 
that they gave us more and more information. We could regard these 
successive average velocities over smaller and smaller intervals as succes- 
sive approximations to the velocity at a particular instant. Successive 
approximation procedures were considered in Unit 7, Sequences and 
Limits I, and we saw there how to define a number called the limit of a 
real function. In the present c the approximation depends on the 
duration of the time interval [¢, , 2]. If we denote the duration t, — t, by hy 
then tr; = 1, + hand we can define the function g by: 


f(t +h) f(y) 


ght) +h) = 
h 


(he subset of R*) 
We are interested in the limit of g near 0. which we shall define to be the 
instantancous velocity at t,. Similarly, the instantaneous velocity att 
can be defined by the limit near 0 of the function: 

fitz) — (tz — 


ght wits = ht) = = Mi esubset of R*). 


rien 5 aS “s 
To see how this limit can be used to avoid the piificulty, try the following 


exercise. 
Exercise 1 


Galileo is (apocryphally) said to have tested the law of gravity by dropping 
objects of differing weights from the Leaning Tower of Pisa; we shall 
consider a similar experiment. Assuming that after the object has fallen for 
t seconds it has fallen exactly 5t? metres, and that it takes exactly 3 seconds 
to reach the ground, find the average velocity 

(i) over the last second of fall ; 

(ii) over the last tenth of a second: 
(iii) over the last hundredth of a second. 


What is the instantaneous velocity at the moment of striking the ground? 
Can you find a formula for this? a 


Solution 1 


Equation (1) 


Equation (2) 


Exercise I 
(3 minutes) 


We can rewrite the definition of average velocity in Equation (2) in the 
following way: 


a S (tz = h) = f(tr) 


=} (he subset of R*), 


& 


If we now put h = —k, this becomes; 


gh ee) (k € subset of R~, the set of all 


negative real numbers), 


and we see that this has the same form as Equation (1), except that the 
domain consists of negative numbers instead of positive numbers. So if 
we consider the function: 


giht—+wi(t,t +h) (he subset of Rh # 0), 


then its images give the average velocity over a time interval of length /, 
beginning or ending at time t, depending on whether h is positive or 
negative. Although 0 does not belong to the domain of this function, the 
function can have a limit near 0. 


w(t,t+h) 


In Unit 7, Sequences and Limits 1, we defined the limit to be the number, 
L, (if it exists) such that for each positive number ¢, however small, there is 
4 positive number 6 such that the image of the set {h:—d <h <6 and 
h # 0} is a subset of [L ~ e, L + ¢]. The value of this limit is called the 
instantaneous velocity at time ¢. If we write v(t) for the instantaneous 
velocity at time 1, this definition takes the form 


f(t + h) = fit) 


AG a 


1 
a nae [iawn] 


Exercise 2 


Find the instantaneous velocity at time ¢ of a car which moves in such a 
way that its distance from some fixed point on the road it travels is at® 
where a is some positive number. a 
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Main Text 


Definition 


Exercise 2 
(3 minutes) 
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Solution 1 Solution 1 
The average velocity over the last h seconds of fall is, for h > 0, 


distance travelled in [3 — h, 3} 
duration of [3 — h, 3] 


_ 5x3? —5x3—h? 


w(3 — h, 3) = 


h 
_ 45-5 x (9 — 6h +h?) 
= h 
_ 30h — Sh? 
Tow 
= 30 — Sh. 


(This last line can only be obtained from the previous one if we assume 
h # 0, since we cannot divide by zero.) So the function we are interested in 
7B gih—+30- Sh (he R10< h <3). 
The average velocity over the last second (h = 1) is 
w(2, 3) = 30 — 5 = 25 m/s. 
The average velocity over the last yy second (h = 0.1) is 
w(2.9, 3) = 30 — 0.5 = 29.5 m/s. 
The average velocity over the last ¢by second (h = 0.01) is 
w(2.99, 3) = 30 — 0.05 = 29.95 m/s. 


Taking the limit of g near 0 gives the instantancous velocity at 3 seconds 
as 


lim* (30 — 5h) = 30 m/s. 
hw" 


Notice that this is not the same as g(0), because g(0) does not exist, Of 
necessity the domain of g does not include 0; the simple expression 


A 30h — Sh? r 
30 — Sh arises from aula and the latter expression becomes 
meaningless if we set h = 0. a 
Solution 2 Solution 2 


The position of the car at time f is given by the function 
f:t-—at* (te Rb, the set of all non-negative real numbers). 


For any ¢ in Rj, the average velocity of the car over a time-interval 
(t.¢ +h), h > 0, is given by: 


a(t + hy’ — at? 
h 
_ a(t? + 37h + 3th? + h*) — at? 
- h 
= 3at? + Bath + ah*) (he R,h > 0) 


w(t,t + h) = 


The velocity at time ¢ is then given by 
u(t) = lim w(t, t + h). 
nO 
It should be intuitively clear that v(t) = 3at? (re RG ), since the terms 3ath 
and ah? tend to zero as h tends to zero. a 


* We use the notation lim, rather than lim, because here we have a “one-sided limit”; 


02 * ret) 
that is, h is essentially positive and so / can only tend to 0 via positive values, 


12.1.4 The Derivative 


Just as the idea of average velocity can be generalized to give a definition 
of average rate of change for any function, so the idea of an instantaneous 
velocity can also be generalized to give a definition of rate of change of a 
real function having no direct connection with kinematics. As in the 
discussion of average rates of change, all that we have to do is to apply the 
same formula as in the definition of instantaneous velocity, and call the 
analogue of instantaneous velocity the (instantaneous) rate of change, That 
is to say, if f is any real function, we can define the (instantaneous) rate of 
change of f(x) at the element x in the domain of f, to be the number 

tim SL) 

mo oh 
provided that this limit exists. This rate of change at x is usually called the 
derivative of f at x and it is denoted by f(x), Thus the velocity of a car t 
seconds after the start of its journey is equal to the derivative at f of the 
function (numbers of seconds since starting)" (distance since starting). 


There are other notations for the derivative; in the most important of 


we 


them, the Leibniz notation, we write —— in place of f"(x). This notation 


is discussed in Appendix I, but bathe with it is not essential to the 
Foundation Course. 


Like the average rate of change, the instantaneous rate of change, or 
derivative, has a very useful interpretation in terms of a pictorial 
graph of the function, We have already seen that the average rate of 
change of f(t) over an interval [r,, 1, + h] equals the slope of the straight 
line joining the points on the graph corresponding to t, and t, + h. 


yf 
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Main Text 


Definition | 


Notation | 


To obtain the derivative at 1,, we let the magnitude of h get smaller and 
smaller. In the above figures the point denoted by @ then slides (like a 
bead) along the curve towards the point P which remains fixed. Corre- 
sponding to the first diagram, three successive positions for Q are shown 
below, labelled Q,, Q,, 03. (We have spaced the points generously round 
the curve, so that the figure is clear.) The corresponding positions for the 


f(t) 


straight line PQ are also shown, As Q approaches P, the line rotates about 
P. and approaches a limiting position which we define to be the tangent to 
the curve at P. It follows that when h is very small, the slope of the line PQ 
is very close to the slope of the tangent at P, and consequently that the 
limit of the slope of the line PQ near P is equal to the slope of the tangent 
at P. 


4 ft) 


In symbols, using Definition |, we have: (slope of tangent to graph at 
P) = (derivative of function at f,) 


The geometrical interpretation of the derivative is very useful, not only in 
geometry. We shall use it frequently when discussing applications of the 
derivative; for example, when obtaining approximations to real functions 
(Unit 14, Sequences and Limits 11) and when finding the maximum and 
minimum values of the images of real functions (Unit 15, Differentiation 
TI). 

There are functions whose graphs do not have tangents at every point. For 
example, the graph of the function 


f:xt—> |x| (xe R) 


does not have a tangent at (0,0). 
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Definition 2 
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F(x) = bl 


You can see this by considering the slope of the line OQ as Q approaches the 
origin, O, where Q isa point on the graph. Consider first the case in which @ 
isa point on the graph of the function x:—+ — x, and then that when Q is 
4 point on the graph of the function x-— x. The slopes of the lines are 
respectively —1 and +1; the tangent at (0, 0) does not exist. We have 


A,J(x)_ +1ifx>0 im SHO) 
nh 


lim 


= “—— does not exist. 
ro oh —lifx<0 ho 


We say that the derivative of f at 0 does not exist. 


The illustration below shows the graph of another function which has a 
tangent at each point, except at (1,, /(t,)). Again, the derivative of the 
function at r, does not exist, 


f(t) 


graph of f 


Exercise 1 Exercise 1 
(4 minutes) 


Which of the following statements are true? 
(i) The tangent toa curve at P cannot cross the curve 
at P. TRUE/FALSE 
(ii) The angle between the tangent toa curve at P and 
the horizontal axis is the limit, as Q approaches P 
along the curve, of the angle between the line PQ 


and this axis. TRUE/FALSE 
(iii) The tangent to a curve at P may be defined as the 
straight line that meets the curve only at P. TRUE/FALSE 
(iv) If f is continuous at every element in its domain 
there is a tangent at every point on its graph. TRUE/FALSE 
(See Unit 7, Sequences and Limits I for the defini- 
tion of continuity.) a 
Exercise 2 Exercise 2 


. 7 7 (3 minutes) 
Show that the derivative of a constant function at any element in its 


domain is 0. a 
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Solution 1 Solution 1 


(i) FALSE. Consider, for instance, our example of a car whose position 
is described by 


Sit—+rat® (teR3), 
We found that 

f(t) = 3a? (reRg). 
Similarly, for the function 

git— at? (te R) 
we can show that 

g(t) = 3ar? (re R). 


The derivative of g at () is therefore zero, so the tangent at this point is 
the 1-axis, which crosses the curve specified by y = g(x) at (0,0), 


A) 


(ii) TRUE, whenever the tangent at P exists. 
(iii) FALSE, Consider the following diagram: 


(x) 


Line B meets the curve A only at Q, but the tangent to the curve at Q 
does not exist. Line C meets the curve A at P, Rand S and isa tangent 
to the curve at S. Line D meets the curve A at T only, but it is not the 
tangent to the curve at T. 

(iv) FALSE. Consider the modulus function again: 


Six |x| (xe R). 
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f(x) =1x1 


This function is continuous at every element in its domain, and in 
particular at the origin, O. But 


A,f0) _ fO +h —f0)_f{ Lifh>o 
h h ~ (-Lifh<0 
so there is no number L for which 


im) oy . 
nmo oh 
Solution 2 Solution 2 


Let f:x'—— > a(x € R) where a is any real number. Then, for all x € R and 
all non-zero he R, we have: 


A(x) _ f(x + h) = fx) 


h h 
_Ge~ 8 
| 
=0. 
Thus, 
I(x) = limo 
no 
= 0, a 
Exercise 3 Exercise 3 
(3 minutes) 
If f is defined by 
St) = St? (re R), 
find f’(t). (You have solved a similar problem in Exercise 12.1.3.2) a 
Exercise 4 Exercise 2 
(2 minutes) 


Sketch the graph of 
fit— (te RR). 

Find f(t) and evaluate 
S(—3), FO) and f(2). 


What can you deduce about the slopes of the tangents to the graph of 
fat —3,0 and 2? 


Sketch these tangents on your graph. a 


FM 12.1.4 


Solution 3 Solution 3 
For all re R, and all non-zero he R, we have: 


As) _ flt +h) — fe) 
an h 
_ S(t +h)? — St? 
h 


= 10t + 5h 
Thus 
f(t) = lim (10¢ + Sh) 
hoo 
= 100. a 
Solution 4 Solution 4 


Again, for all te R and all non-zero he R, we have: 
ASW _ fi +h) = fo 


h h 
(t+ hp -e 
h 
=2+h. 
Thus, 
f'(t) = lim (2t + h) 
ho 
= 2, 
In particular, 
f(-3) = -6 
S(0) = 0, 
f'(2) = 4; 


so the slope of the tangent of the graph is negative at —3, zero at 0 and 
positive at 2, 


Your graph should look something like this: 
4 f(t) 


(-3,(-3)) 
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12.2 THE RULES OF DIFFERENTIATION 12.2 
12.2.0 Introduction 12.2.0 
We have already indicated the wide range of applications of the concept Introduction 


of derivative which we defined in the preceding section. To make full use of 
this concept we need to look at some of the properties of the derivative. 
The first fruits of this study will be a set of rules which will enable us to 
calculate derivatives of functions quickly and easily, without having to 
go back to the definition every time. 


12.2.1 The Derived Function 1221 
We start with the definition of the derivative of a real function, f, at x: Main Text 
(3 x+h X 
J'(x) = lim An fix) = lim fists St) 
noo oh hv0 h 


The derivative is a number; however, since this number depends on the 
value of x, we can use the notion of derivative to define a function. This 
function maps x to the value of the derivative of f at x. In fact, by our 
notation f“(x) we have already implicitly recognized the existence of this 


function. This new function /” is called the derived function of f, and in Definition 1 
finding it we are said to differentiate the function f- Definition 2 


The domain of the derived function will be taken to comprise all the values 

of x for which f(x) exists, Since Definition | involves j, the domain of the 

derived function must be a subset of the domain of /. In some cases the 

two domains may be the same; in others, the domain of /’ may be a proper 

subset of that of f; that is, it may exclude certain numbers that are in the 

domain of /, because the limit defining the derivative does not exist at these 

numbers. We say that a function / is differentiable at those elements in its Definition 3 
domain where f(x) exists. Thus the domain of /” is that subset of the oe 
domain of f comprising the numbers at which / is differentiable. 


Example | Example t 
The derived function /’ of the function 

fet (xe R) 
is given by: 


1 


hy - I 
I(x) = reesei x li ! = 
“oO nol 


and the limit exists for all x € R; so the domain of /” is also R. 


1x), 1100 


domain of fand f’ 
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Example 2 Example 2 
The derived function g’ of the modulus function 
gixt—>|x| (xe R) 
is given (where it exists) by: 
ie 


hoo 


g(x) 


The evaluation of this limit, though not difficult, is a little tedious (see text 
following this example), A simpler method of finding the domain of g’ 
is to use the graph of g. The graph below shows the point (0,0) at which 
there is no tangent; since the slope of the tangent coincides with the value 
of the derivative, we see that there is no derivative, ic, the limit does not 
exist near x = 0, At all other points there is a tangent (in fact it coincides 
with part of the graph) and so the function is differentiable everywhere 
except at x = 0. Thus the domain of g’ consists of the set R with 0 omitted, 


9(),9'(x) 


Like all arguments based on graphs (or figures in general), the one given in Discussion 
Example 2 makes use of geometrical intuition and it is therefore a demon- 
stration (in the sense defined in section 7.2.3 of Unit 7, Sequences and 
Limits I) rather than a proof, For a proof we must go back to the definition 
and evaluate the limit, We consider three cases: 


(i) x>0 
In evaluating the limit it is only small values of |h| that matter; so 
we need only consider h > —x, so that h + x > 0. It follows that 
|x| = x and |x + h| = x + /; the definition of /’(x) therefore gives 


(X+h)—x_h 


h feats 


f(x) = lim 
hoo 
(ii) x <0 


Again, we need only consider small values of ||, We take h < —x, so 
that h + x <0, It follows that |x) = —x and |x + h| = —(x + h); 
so the expression for /’(x) is 


f= lim + hl — |x| he —(x+h)+x . —h ‘a ail 
bo h no h h 
(iii) x =0 
The expression for {“(x) is 
h 
S'(x) = lim Ua 
woh 


hi 
but since takes the value +1 for small positive h and —1 for small 


: h i 
negative /, there is no number close to a for all small h: that is, the 


function x-— |x| is not differentiable at 0. 


12.2.2 Differentiation of Polynomials 


You have already seen how to differentiate a few functions. The results are 
shown in the table: in each case f has domain R. 


Tt se Page Number 
x+—> constant x—0 (xeR) 15 
xx xe—til (xe R) 17 
Kind xh— 2x (xe R) 16 
x x—+3x? (xe R) 14 


x-— |x| 


x1 (xeR*) 
x= 1 (xe R>) 


To work out each new derivative, when it is required, from the definition 


L(x +h) = fix) lim Sef) 
ok . yee 


oes tia moh 


0 
would be very laborious. It is much easier to work out in advance a 
system of rules which makes it possible to differentiate many functions 
from a knowledge of relatively few basic derivatives. To formulate these 
rules, we begin with a class of particularly simple functions, namely the 
polynomial functions. The definition of a real polynomial function was 
given in Unit 4, Finite Differences: it is any function of the form: 


PX gX" + Aq yx"! tees tb ayx+aq (xe R) 
where do, ,,.+.,d,€ R. 


We saw in Unit 4 that the difference operator A, applied to a polynomial 
function of degree n(n € Z*) gives another polynomial function of degree 
n—1. In view of the presence of A, in the definition of f(x), it is 
not surprising that just the same thing happens when the polynomial 
is differentiated instead of differenced. 


Exercise 1 


Differentiate the function: 


f:x-— ax? + bx +e (xER). a 


Rather than try to differentiate the general polynomial function of degree n 
immediately, let us begin with the simplest such polynomial function 
which is 

xH— x" (xe R) 


where n is any positive integer or zero. The cases n = 0, 1, 2, 3, have already 
been treated in various exercises and examples; so if you know the 
binomial theorem you should not find the general case difficult. 
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2.2.2 


Exercise 1 
(3 minutes) 


Main Text 


(See RB9) (continued on page 20) 


Solution 1 

We must differentiate the function 
fix-—ax? + bx +e (xR). 

That is, we must find 


fs) = lim L(x + h) = f(x) 
h=0 h 
a(x +h)? + b(x +h) + ¢ — (ax? + bx + ©) 
= ae h 


. 2axh + ah? + bh 
im 
nO h 


i 


lim (2ax + ah + b) 
a0 


2ax + b, 


The derived function is, therefore: 
[':x'—> ax +b (xe R), 
Notice that if we write 


fix ax?, fyix'-—> bx, fyixr—ae (xe R), 
then 


ith +Ss 
and 


Paty t+ fatty | 


(continued from page 19) 


2 


Exercisi 
Differentiate the function: 
Six-—>x" (xe R) 


where n is any positive integer or zero, a 


Extending the f’ notation for derived functions, we can write the result 
of Exercise 2: 


(x-— x")! = (x-— nx") (WER). 


Some of the results obtained earlier (for example, the derivative of a 
constant function is zero, the derivative of t-—+r° is t-— 31°) are 
special cases of this important result. The general polynomial 


P(X) = yx" + dy yx"~! + ++ + ax + do 


can be built up from “elementary” polynomials of the form x" by first 
multiplying each of these elementary polynomials by the appropriate 
coefficient a, to get one of the terms a,x* in the general polynomial, and 
then adding all these terms together. By the corresponding operations on 
the “elementary” polynomial functions x-—+x* we can build up the 
general polynomial function: 


Pix agx" + ay x" | toe Fax +dy (XER). 
It follows that we shall also be able to build up the derived function p’ 


provided we can obtain rules enabling us to deduce: 


(i) the derived function of x a,x" from that of x-— x*, 
(ii) the derived function of a sum of functions from the derived functions 
of its individual terms. 


Exercise 2 
(2 minutes) 


Mala Tent 


(Notice that this procedure is exactly the same as that adopted when 
differencing polynomials in Unit 4, Finite Differences, when calculating 
limits in Unit 7, Sequences and Limits 1, when calculating definite integrals 
in Unit 9, Integration I and when discussing switching circuits and other 
systems in Unit //, Logic I. In each case we deal with “complicated” 
expressions by breaking them up into “simpler” ones, dealing with these 
and then finding rules, usually in the form of morphisms, for putting the 
results together to obtain the result for the “complicated” expression.) 
Perhaps you would like to guess these rules for yourself. 


Exercise 3 


Guess the correct expressions to put in the boxes: 
(i) For any non-zero number a and any function f, the derived function of 


t-—+af(t) is where t € (domain of f"). 

(ii) For any two functions f and g with the same domain, the derived 
function of t-—+ f(t) + g(t) is where f€ (intersection of the 
domains of f° and g’), Is the function r-—+/(t) + g(t) necessarily 
differentiable in this domain? a 

Exercise 4 

Using the definition of a derivative, prove or disprove the guesses you 

made in the preceding exercise. a 

Exercise 5 

Express the derivative of f + g in terms of /’ and g’. a 


We can state the two rules of differentiation given in Exercise 3 as follows: 


First Rule of Differentiation 
(af) = af’, ie. multiplying any function by a number multiplies 
its derived function by the same number. 

Second Rule of Differentiation 


(f + 8) = JS’ + g', ie. the derived function of a sum of functions 
is the sum of the individual derived functions, provided the 
domains are appropriate. 


The second rule can be extended to cover any (finite) number of terms in 
a sum. We can now construct a derivative for the general polynomial 


DiX*— a,x" + ay yx"! + + aX +aq (KER). 
We have already shown that 
(xt—> xt)! = xr kxt! (xe R). 


By Rule 1, multiplying a function by a constant (in this case a,), multiplies 
its derived function by the same constant, so 


(x-—> ax") = x-—r kay"! (xe R). 
By Rule 2, the derived function of a sum of two or more functions is the 
sum of their individual derived functions. The polynomial function pis the 
sum of the functions (x-— a,x"), (xa, ,x"~") etc.; therefore its 
derivative is the sum of their derivatives: 
p! = (x-—+na,x"~") + (x(n — Ia, x"7?) + 

+ (x-—a,) 

that is, 

pi = x+—>(na, x"! + (mm — Way yx""? +--+ + 2aax + a4) 

(xe R). 
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Exercise 3 
(3 minutes) 


Exercise 4 
(5 minutes) 


Exercise 5 
(1 minute) 


Main Text 


Rule 1 


Rule 2 


Solution 2 


Hale lim & +hy'— x 
ho h 
fm nae ge) x7 2h? + opin} — x" 
= lim 2 
0 h 


(by the binomial theorem) 


= lin fe + nin = Deana tet wih 


(on dividing by h, since h # 0) 
= nxt 
(by the methods of Unit 7, Sequences and Limits 1). 

Consequently the derived function of 


xr— x" (xR) 


xr— nx"! (xe R) a 


Solution 3 

(i) t-— af (0) 

(i) b— f+ B'(0. 

The function 1-—> f(t) + g(t) is always differentiable, where fe (inter- 
section of domains of /’ and g’). a 
Solution 4 


We use the rules for addition and multiplication of limits of functions, 
which are similar to those for limits of sequences given in Unit 7, Sequences 
and Limits I. 


(i) When t € (domain of /") then 


wes (ure +h) - =a lim (| +h) - fo) 
hao hoo h 


» +h) - Le) 


= tim (a) x tim 


= aft), 
(ii) When ¢ € (intersection of domains of f” and g’), then 
im (f(t +h) + git + wi {fo +a 
ea 


oe fo) eae #0) 
he 


A(t + on LO) ‘g(t + h) — git) 
a h 


no 
=f) + g(t. a 
Solution 5 
This is just a formal statement of the result of Exercise 3 part (ii). 
(f+ a) =f) + 80 
(t intersection of domains of f’ and g’). |_| 


B 
8 
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Solution 2 


Solution 3 


Solution 4 


Solution 5 


Exercise 6 
(i) Differentiate 
x—> 10x5 + fxd + x (xe R). 


(ii) Differentiate x+—+2x + |x| (xe R) using the derived function 
calculated in Example 12. (Do not forget to determine the 
domain of the derived function, i.c. the subset of R for which the given 
function is differentiable.) a 


12.2.3. The Differentiation Operator 


We have seen in Section 12.2.1 that to every function / there corresponds a 
unique derived function, f", whose domain is a subset of the domain of /, 
(In some cases, this subset contains no elements, that is to say cannot be 
differentiated anywhere. Can you think of an example? In case you cannot, 
you will find one in Example 1 below.) 


That is, we have a rule which assigns to each member, f, of the set of real 
functions a unique function /’; the rule can be described by the word 
differentiate, and you may find it helpful to think of it as a machine which 
takes one function and turns it into another. as illustrated below; 


INPUT MACHINE OUTPUT 


—— DIFF! TIATE po 
f f 


Example | 


An example of a function which cannot be differentiated anywhere in its 
domain is a function whose domain consists of isolated numbers, for 
example ; 


fix-—ex (we {l,2,3,4}). 


The reason why this function cannot be differentiated is that the definition 
of a derivative at x involves taking the limit of 
A(x +h) = f(x) 


h 


as h tends to zero, But this means that there must be some interval 
{x — h.x + h] contained in the domain of f; otherwise we cannot apply 
our definition of a limit. But if x is one of the numbers 1, 2. 3 or 4, and 
0 <h| < 1, then x + h cannot be one of these numbers, so /(x + h) is 
not defined ; and hence the derivative at x does not exist. a 


Here we have a situation very similar to one in Unit 4, Finite Differences, 
where we also had a rule for turning one function into another. There we 
defined the difference operator : 

By: fi— [x f(x +h) — fd) (fe F) 
where F is the set of all real functions. 
Here we shall do a similar thing: we define an operator D whose effect on 
any function in its domain is to differentiate it: 

D:f— f'. 


It is called the differentiation operator. To complete the definition of the 
operator D we must specify its domain. We shall take this domain, as for 
A, to be the set of all real functions, F, Using the operator D we can write 
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Exercise 6 
(2 minutes) 


122.3 


Main Text 


Example 1 


Definition 1 


(continued on page 24) 
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Solution 12.2.2.6 Solution 12.2.2.6 


(i) Using the formula for the derivative of a polynomial function, the 
derived function of 


x-— > 10x5 + fx3 +x (xe R) 


x 50x* + gx? +1 (xe R). 


(ii) Let f:x-— 2x, and g;x—> |x| (xe R). Then f’:x+—+2 (x € R) and 
gix-——l (x >0) 


xeRAx #0* 
xt—+—1 (x <0) 


The intersection of the domains of " and g’ is equal to the domain of g’, 
So the derived function of x-—— 2x + |x| (x R) is 


xt—> f(x) + g(x) (xER A x #0), 


(x > 0) 
<—> 


(xeRAx #0). a 
1 (x <0) 


(continued from page 23) 
the results of section 12.2.2 as follows: 
Dix x") = x-— mx! 
Duf)=abf (aeR A feF) 
Dif+g)=Df+ Dg (feF AgeF). 


We have followed the same convention as in Unit 4, Finite Differences, 
writing Df as an abbreviation for D(f); and we have written af as an 
abbreviation for the function x+— af (x) (x € R). We also often drop the 
specification of the domain of the function being differentiated, as we have 
done in the first result above. 


Exercise | Exercise 1 
+ ‘ an ae a an (2 minutes) 
Using notation similar to that for the finite difference operator A,, it is 


usual to write D? for D» D, D* for D« D» D, and so on. 
(i) Evaluate D? (x-— > ax? + bx + c) 
(ii) Evaluate D? (x-—> x°*), a 


If f is any real function, the function D?f is called the second derived Main Text 
function of f, D*f is called the third derived function, and so on. Definition 2 
In the notation used in section 12.2.1, we would write: f” for D*/; f” or Notation 1 
Sf for D°f; f for D*f; f for D”f, and so on. 

One of the reasons for introducing the operator D is that it enables us to 

relate the properties of the differentiation mapping to those of other 

mappings we have met. 


* The symbol A is defined in Unit //, Logic J; it means “and”. 


Exercise 2 
Can you recognize the property of D expressed in the form: 


Df + g) = Df + Dg? a 


There are other binary operations on the domain F of D, and it is natural 
to ask whether there are any more useful morphisms about. For instance, 
what about multiplication? Suppose we asked you to guess? Would you 
guess 


D(f x g) = Df x Dg? 


If you think that this guess is reasonable, then we suggest you work 
Exercise 3; incidentally, Leibniz, one of the inventors of calculus, thought 
it was reasonable at one time. 

But we are really not tackling the problem correctly; the question we 
should ask first is: “Is D compatible with x2” (For the definition of 
compatibility, see Unit 3, Operations and Morphisms,) If the answer is 
“No”, then there is no morphism to be found. If the answer is “Yes”, we 
can then go on to see if we recognize the induced binary operation. We 
ask you to investigate this in Exercise 4, The consequences of Exercises 3 
and 4 are followed up in the next section, 


Exercise 3 
Show by an example (as simple as possible) that cases exist where 
D(f x g) # Df x Dg 


so that D is not a morphism of (F, x) to (F,, x). a 
Exercise 4 
Is D compatible with multiplication? | | 


Hints for Exercises 3 and 4 
(i) The definition of f x g is given in Unit J, Functions. 
(ii) Try fi: x-— x (xe R)). : 
in Exercise 3. 
and g,:x-— 1 (xe R), 
(iii) For Exercise 4 you may also find it helpful to try 


Fa (xeR) and g3:x-—2 (xe R). 
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Exercise 2 
(3 minutes) 


Exercise 3 
(3 minutes) 


Exercise 4 
(5 minutes) 


Solution 1 
(i) Let 
fix-— ax? + bx +e (xER) 
then 
Df:x-—> 2ax + b (xe R) 
and 
D?f:x-—> 2a (xe R). 
(ii) 
fix (xe R) 
Df:x-—> 3x? (xe R) 
D?f:x-—> 6x (xe R) 
D°f:x-—>6 (xe R). 


(Note the fact that we have included the somewhat trivial information 
(x € R) on every line, to emphasize that the domain stays the same through- 
out, It looks rather unnecessary here, but if we were differentiating more 
complicated functions we might well find that the domain changed in 
going from a function to its derived function.) i | 


Solution 2 


D is a morphism of (F, +) to (F,, +), where F, is the set of images of 
elements of F, and + denotes the addition of functions. D is many-one; 
for instance: 


D(x'—+ x?) = D(x-—+ x? + 2) 


and so D is a homomorphism. a 


Solution 3 
Using Hint (i), we try the functions: 
Ai xox (xe R), Dfyixe—l (xeR) 


and 

Byixt— +1 (xe R) Dgy:x-—0 (xe R); 
then 

fi % By: xt—x (xe R), Dif, xg): x1 (xe R), 
But 


Df, x Dg, =x—0 (xe R) 
# Df, x 8). a 


Solution 4 


Recalling the definition of compatibility in Unit 3, Operations and 
Morphisms, we know that D is compatible with multiplication if, whenever 
Df,) = D(fr) and D(g,) = Dig), 
then 
Dif x 81) = Df * 82). 
Now we cannot prove a formula true for all elements in a set simply by 
looking at some of the elements in the set, whereas we can prove it false 
by showing that it is false for some elements of the set. So the fact that 


Hint (iii) gives us particular examples should tell us that D is going to 
turn out not to be compatible with multiplication. 
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Solution 1 


Solution 2 


Solution 3 


Solution 4 


We could use the functions in Hints (ii) and (iii) as follows: 
D(f\) = D( fz) = (x-— 1) (xe R) 
Dig;) = Dig2) = (x-—>0)_ (xe R) 
Dif, * g1) = (x*— 1) (xe R) 
D(fa * 82) =(x— 2) (xe R) 
80 


Df, x 81) # D(fr x 82) 
and so D is not compatible with multiplication. 


An example such as this, which is used to show that a statement is false, 
is called a counter-example. a 


12.2.4 Differentiation of Products 


The result of the previous exercise has shown that differentiating the 
product of two functions is not going to be such an easy matter as differen- 
tiating their sum. In this section we shall investigate how to differentiate 
such a product. 

We have already seen (in section 12.2.2) the usefulness of having a rule 
for differentiating a product of two polynomial functions when one of 
them is a constant function, The rule is 


D(af) = abf. 


As long as we want to differentiate polynomial functions only, this is all 
we need to know about the derived function of a product, but to differen- 
tiate a function such as 


xr— xsinx (xe R) 

which is a product of the two functions 
xx (xe R) 

and 
x—sinx (xe R) 


neither of which is a constant, we need a more general rule for differentiat- 
ing products of functions. 

Let us denote the two functions whose product we wish to differentiate 
by fand g, and their product by k, so that 


Sxgak 
We assume for simplicity that the functions have domain R. To illustrate 
the product relationship and the subsequent argument, we suppose that 
the elements of the domain of f, g and k are times (represented by a variable 


1), and that the images of ¢ under the functions f, g and k are the sides and 
area of a rectangle: 
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Main Text 


Our problem is to evaluate k'(t), the rate at which the area changes with 
time. By the definition of a derivative, this is 
ki) = tim kit + h) — k(t) 

0 h 
that is 

eth 0 

k(t) = lim St + Matt + h) = flogte) 
hd, h 


The numerator is the difference in area of two rectangles: 


git+h) g(t) 


pemmenth EE 1) eens 


and is therefore equal to the area of the shaded L-shaped strip in the above 
diagram. This strip can be treated as the sum of three rectangles, as shown 
below 


(Remember that A, f(t) = f(t + h) — £(0),) 
Adding the areas of the rectangles and substituting in Equation (1), we 
get 


k(t) = lim LOA + BOA SO + Apel, J) 


nv h 


ASO) pir AOA S(O) 
h h0 h 


= f(ptim 229 5. gym 
ok ey 


by the rules for limits given in Unit 7, Sequences and Limits I (since f(t) 
and g(t) do not involve h). Hence 


Antti, 

k(t) = fide’) + sf) + lin SEM, 

by the definition of a derivative. 

The last term on the right-hand side can be written 
tim AO)» tim Ayate) = s') x 0 = 0, 
no oh hao 

since both these limits exist and 


lim A,gir) = lim [g(t + h) — g(t)] = 0, 
hwo h=0 
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Equation (1) 


Equation (2) 


Equation (3) 
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ifg is a continuous function.* (In fact, this is not a new assumption since 
we have already assumed that g is differentiable, and it can be proved 
without much difficulty that every differentiable function is continuous: 
a result which is intuitively obvious if we regard differentiability in terms 
of being able to draw tangents.) 


Equation (3), therefore, becomes 
Ky) = flog) + gO f(o). Equation (4) 


That is, the rate of change of the area of the rectangle is the sum of two 
terms: the rate at which area is being added because the length of the 
side of original length f(r) is increasing, plus the rate at which area is 
being added because the length of the side of original length g(t) is increas- 
ing. 

Equation (4) is the rule for differentiating products. We have thought of 
tas a time in deriving this rule, but the rectangle illustration can be 
droppedt and the mathematical argument applies to any product of 
functions with domain R or some subset of R. The Product Rule can, of 
course, also be written in terms of the functions themselves instead of 
images; it then takes the form: 


(fey = Se’ + af Product Rule 
or, in terms of D: 
D(Jg) = {Dg + gbf. 


This can be regarded as a generalization of the equation D(af’) = aDf, to 
which it reduces when g is a constant function. 


Exercise 1 Exercise 1 
. : (S minutes) 

What can you say about the domain of (z)' in terms of the domain of /” 

and g’? a 

Exercise 2 Exercise 2 


(2 minutes) 
Verify the product rule by considering the product of the two functions: 


Sixt —a 2 1 (xe R) 


gix-—tx- 1 (xe R). a 


* Notice that we have to assume that g is continuous in order to be sure that tim att +h) = eft), 
Ifg is not continuous, then this limit may exist and be different from g(t): see examples in 
Unit 7, Sequences and Limits 1. 

+ Note that we could write Equation (1) in the form : 


, LC + hgit + A) — firdgit + h) + fUndgte + A) — Fledett) 
ete) = hima a es ett) 


ie. 
(ey = tim Seflegte + ud + f(OD,800) 
oo 


which again leads us to Equation (4). 


Solution 1 


The domain of the function ( fg)’ = fg’ + gf" is the intersection of the 
domains of f, f’, g and g’. But since the domain of f’ is contained in the 
domain of f, and the domain of g’ is contained in that of g, we can simplify 
this: the required domain is the intersection of the domains of f’ and g’. 


a 
Solution 2 
Jig: x 2x? — x -1 (xe R) 
(fg): x 4x — 1 (xe R) 
S(x)g'(x) + x) f(x) = (2x + Ng'(x) + (x — DSO) 
= (2x + I)1 +(x — 12 
=4x-1. 
Thus 
fe’ + gf'):x— 4x - 1 (xe R), 
and the result is verified. a 


12.2.5 Differentiation of Composite Functions 


In the last two sections we have shown that D is a morphism with respect 
to addition of functions in its domain and codomain, but that no morphism 
exists for multiplication. We shall deal with division later, and subtraction 
follows directly from addition to give 


D(f — g) = Dif +(—g)) where —g:x-—+ —g(x) 
= Df + D(—g) 
= Df — Dg 
using the rule Diag) = aDg, with a = —1. 


The last of the binary operations on functions that we defined in Unit /, 
Functions was the operation of composition 


So gix-— f(g(x)). 
We approach the investigation of composition in the same way as we 
tackled that of multiplication. 
Exercise 1 
Show by as simple an example as possible, that cases exist where 
D( fog) # Df Dg, 


so that D is not a morphism with respect to composition of the functions 
in its domain and codomain, L 
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Solution 1 


Solution 2 


12.2.5 


Main Text 


Exercise 1 
(5 minutes) 
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Exercise 2 Exercise 2 
? (5 minutes) 
Is D compatible with composition? 


So we must try to find a rule for differentiating the composite of two func- Maia Teat 

tions f and g. We write : 
Pega ke ie fog(x) = f(g(x)) = k(x). 

The relation between these functions is illustrated below: 


fog=k 


It can also be illustrated in this way : 


axis g(x)-axis k(x) axis 


H{ g(x)" k(x) 


To determine whether fog is differentiable we consider the formula 
defining a derivative: 


k(x) = lim leva t hh iki) = Kx) 
ho h 


The limit on the right can be interpreted using a diagram similar to the 
one below: 


x-axis k ()-axis 


We see that 
K(x + h) — k(x) = 2102 
h P,P,” 
This may remind you of the scale factor of k at x, which we defined in 
Unit 2, Errors and Accuracy. Here we shall refer to it as the magnification Definition 1 


produced by the function k when it maps the segment P,P; (i.e. the interval 
[x, x + h]) to the segment Q,Q, (ie. the interval [k(x), k(x + h)]). (continued on page 32) 


3 


Solution I 
Almost any example will do; here is one: 
Let 
gixr—— 1 (xe R), gx 70 (xe R); 
fix-—rx (xeR), fix (xe R); 
fog:x-— 1 (xeER); 
(fogy:x-—40 (xeR),  f'og'sxr— 1 (xR). 
So Di f°) # Df» Dg in this case. 


Solution 2 


According to the definition of compatibility given in Unit 3, Operations 
and Morphisms, D is compatible with composition if, whenever 


Df, = Df, and Dg, = Dg. 
then 
D(fi 281) = Afr° 82): 


It can be shown by counter-examples that this condition is not satisfied ; 
for example: 


fuixroxt +1 (ER, Bx x (xe R); 
fax x? (xeR, — gp:x-— x +1 (xER); 
fregyix—x? +1 (xe R) 
faogyix-—(x + 1? (xER) 
Dif, og )ix-—> 2x (xe R), 
D( fz °'@2):X'— 2x + 2 (xE R). 
a 


(continued from page 31) 


Our problem is to express the derived function of fg in terms of the 
functions f and g: in terms of diagrams, it is to combine the last two 
diagrams. We can combine the two diagrams in the following way: 


x-axis g(x)-axis k(x)-axis. 


A(g(x+h)) 
t(g(x)) 


Instead of performing the mapping k in one stage we perform it in two 
stages: we perform first g and then f, Thus the segment P,P; is mapped 
first to the segment R, R; and then to the segment Q,Q>. The magnifica- 
tions produced in these two steps are 


RRs yg 2103 
PP, °™ RRs 
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Solution 1 


Solution 2 


respectively, and the overall magnification is the product of these two: 
O10: RRs, 010 
P,P, PLP, R,Ry 

We have 


Pity 231°) Say 
Eiko _ yp 
in Ps 


(x). 


For the magnification in the first stage of the two-stage process we have 


MRR, 
iene 

Finally, the magnification produced by the functioti f has the limit 
5 212 “ip(x\)e 
tim RRs FRx)". 


This time the derivative has to be computed at g(x) and not at x, since 
the point R, corresponds to the number g(x). 


If we put these three limits together we obtain from Equation (1) the 
Composite Function Rule 


k(x) = g'(x) « f'(g(x)), 
In function notation, we have: 

(fea! =(f'og) xg 
It is often called the chain rule, or “function of a function” rule, since a 
composite function is, in a sense, a function of a function. 
Example | 
Differentiate 

kix-—+(2x + 1)? (xe R). 


(This could be done by expanding the brackets, but we shall apply the 
above rule.) 
If we let 
gixe— 2x +1 (xe R) 
and 
Sixt—> x? (xe R) 
then k = fog, 
We can now use the composite function rule, i.e. 
Kk = (foog) x g’ 


(x+——* 2x) » (x—4 2x + 1)] x (x+-—+ 2) 


= X+—*2(2x + 1) x 2 
=xr—A2x+1) (xe R) a 


Exercise 3 
If 
Six-—>x? +1 (xeER) 
and 
gix-— x? —1 (xe R), 
find fog. 
Differentiate fg directly and also using the composite function rule. 
Compare the two results. a 


* This step really covers up the difficult part of the verification, which depends on the 
continuity of g. A formal proof can be found in Calculus by T. M. Apostol (see Bibliography). 
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Composite Function Rule 


Example 1 


Exercise 3 
(3 minutes) 
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Exercise 4* Exercise 4 
(G3 minutes) 

Differentiate : 

(i) kyixr— (x + 18 (xe R) 

(ii) kyix-— (3x + 18 (xe R). a 

Exercise 5 Exercise 5 
(3 minutes) 

Differentiate: 

(i) ky:x-—+(7x + 3)* (xe R) 

(ii) ky:x—+ (2x? + 3x + 2)? (xe R), a 

12.2.6 Differentiation of Quotients 12.2.6 

In this section and the next we complete the list of rules of differentiation Main Text 


by deducing from the product rule a rule for differentiating the quotient 
of two functions, and deducing from the composite function rule a rule 
for differentiating inverse functions. You are not expected to be able 
to deduce these rules but you are expected to know the results and how 
to use them. 

We begin by finding the derived function of the reciprocal function, 
which we denote here by the letter r, so that 


reel (xERA x #0); 
then 
r(x) = lim [peo 
hoo h 


Prey | af 1 ins 
= tim Rea - | (by definition of r) 


dig ifz= eae) 
 noh\ (x + A)x 


ries las + hx 


= lim pol (the cancellation of the h’s is valid since 
nao\(x + Ax} yy 0) 


Taking the limit as h +0 we obtain (since x # 0) 


r(x) = 
ie. 
‘4 
D{ x-—-| = 
x 
or 
r=-rxr 


the domain being the set of non-zero real numbers throughout, 


* You may like to use the diagrams at the back of this text. 
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Note Notation 
(i) We shall denote the product function r x r by r?. 
Thus 
1? x r(x) x 1x) = (r(x). 
(ii) If 
Bix——> u(x), (xe R) 
then we shall denote the function: 
1 


ees) 


(xe R A v(x) #0) 
1 
by -. 
y # 
We can now apply our last result to the differentiation of the quotient 
of two functions, If two functions u and v have as domain the same subset 


of R, and codomain R, then we define their quotient, or u/v, by 


UX 
x (x € domain of wand v, A v(x) 4 0). 
v Ma 
The formula for differentiating such functions is a useful one. It may 
help you to remember the rule if you work it out for yourself: so the rest 
of the derivation of this formula is set as an exercise. 


Exercise 1 Exercise 1 
: ; 3 : F (5 minutes) 
Denoting the reciprocal function by ras in the text, we can write 


u iy 
~=uUX—=uU x (rev), 
v v 

Using the following results: 


(i) the derived function of r is —r?. 
(ii) the product rule, i.e. 


(Sf xeyaf xgtsfxes 
(iii) the composite function rule, i.e. 


(fog) =(S' 28) x 8’, 


wW id henry 

express (: in terms of uw’, v', wand v. a 

The result of the last exercise gives the Quotient Rule; Main Text 
(' uli xo 5 ux ov" a 
e v = 


which means that 


F u(x) 
if W(x) = ee) 
then 
i u(x) x v(x) — u(x) x v(x) 
w(x) = —————____—_—, 


(v(x) 


é ul’. % A * i a 
The domain of (‘ is the intersection of the domain of w', the domain 


of v', and the set {x:x  R, v(x) 4 0}. (continued on page 40) 
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Solution 12.2,5.4 
(i) Let 
fiix-—xt (xe R) 


Bix x+1 (xeR). 
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Solution 12.2.5.4 


Adopting the schematic form of the previous exercise, we now have: 


(See illustration on opposite page) 

(ii) We can repeat the process with 
fx (xe R) 
B.:X+—>3x+1 (xeER), 

Alternatively, we notice that 
a(x) = kx) 

so that we can write 
ky = ky o(x+—> 3x) 

whence 
ky = [ky 0 (xt 3x)] x (x3) 

= [xt 5(3x + 1)*] x (x3) 
= x 15(3x + 1)* (xe R). 

In general, if 

k(x) = ky(ax) 


so that 

ky = ky 0 (x—* ax) 
we have 

Ky = [ky 0 (x ax)] x (xa) 
so that 


k3(x) = ak\(ax), 
Solution 12.2.5.5 
(i) x-—+ 35(7x + 3)* (xe R) 


(ii) x—+ (8x + 6)(2x? + 3x + 2) (xe R). 


Solution 1 


(‘) = [u x (rov)]' =u x (rev) +u x (rev) 


=u’ x (rov) +.u x [(r'ov) x 0] 


=u x (rev) +u x (—r ev) x wv. 


=i 
Now rev =—and —r? ov = —-, so that 
v 
u\’ ow ux 
v - v v 
ux 
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a 
Solution 1 
by (ii) 
by (iii) 
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plbtX)S 4x 


plbtx)G<4x 


Exercise 2 


Differentiate: 


a (xe RA x #0) 


where k is a positive integer. 


Exercise 3 


Differentiate: 


@) wyixr-— 


pee i ee 
2x = 
(ii) wy (xeRAX # —}). 


Can you see why the two results are the same? 
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Exercise 2 
(3 minutes) 


Exercise 3 
(3 minutes) 


12.2.7 Differentiation of Inverse Functions 


The last rule of differentiation we shall consider is the rule for differentiat- 
ing the inverse of a function. The inverse of a one-one function, as defined 
in Unit J, Functions, is a function that reverses the effect of the original 
one. For example, the inverse of the function “double the number” is 
the function “halve the number”, since if we double a number and halve 
the result we come back to the original number. We saw in Unit /, that 
a function has an inverse if and only if it is one-one ; that is, ifeach element 
in the domain has a different image. 


It follows that if g is the inverse of f, then 
Bo fix-—x (xe domain of f) 
and 


fogix-—+x (x edomain of g). 


We can use the composite rule to find the derivative of g in terms of 
Seg =xr—x 

therefore 
(fog! =4-— 1 

and by the composite rule, 
(fog) =(S"e8) xg" 

which gives us the Inverse Function Rule: 

Lads! 1 

fee fee 

Notice that we have to take care with the domain of g’: it is that part 


of the domain of g for which g is differentiable and for which f’ ° g(x) 
does not vanish.* 


Exercise 1 
Differentiate 
ith och (xe R*) 


where m is any positive integer. Compare your result with our previous 
result, namely 


(x-—+ x"! = (xe nx") (xERA x #0) 


where n is an integer. i | 


* We also assume, here and everywhere else where we differentiate a composite function, 
that f’ eg is meaningful; ic, that the domain of /" contains the image of the domain of g. 
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Main Text 


Inverse Function Rule 


Exercise 1 
(2 minutes) 


FM 12. 


Solution 12.2.6.2 Solution 12.2.6.2 
There are many ways of tackling this: we shall use the quotient rule with 
uxt (xe RA x #0) 
vixt—> xt (xeR A x #0), 
We already know that 
piixt—okx'"! (xe R Ax #0) 
so that 


u(x) * v(x) 


u(x) x v(x) _ Ox xt — 1 x kat! 
7 = ak 


v x 
Saket 


—k 


Considering the form —kx~*~', and remembering that we were differ- 
entiating x-—>x"* (xe R A x # 0), we can see that we can now write 


D(x x") = x-— nx"! 


for any integral value of n, 
(the domain is the set of non-zero real numbers when nis negative). 


Solution 12.2.6.3 Solution 12,2.6.3 
The answer to both parts is 


4 
axa (xeR Ax # —}). 


The two results are the same because w,; — w) is a constant function: 


ax=1 2 
MAES Soe 2x 
(Qx +1)-2 2 
EE 
2x +1 2x +1 


so that 
W2 — Wp = x-—— 1 (xeRA x #—4) 
whence 
(wa — wy)’ = wy — Wy = (x 1)! = x0 
ie. 


w(x) — wy(x) = 0 
or 


w(x) = w} (x). a 


Solution 1 Solution 1 
Call the given function g. Then the inverse, f, of g is given by 

fxt—+ x" (ER), 
so that 


fix mx"! (xe R*). 


The inverse function rule then gives 
8X) = timp 


1 1 
Se a, 
ml Vim 


gx yim 
m 


We have shown that the result: 


the derivative of x-—+ x" is x-—+nx"~ ', where ne Z, also holds when 
nis the reciprocal of a positive integer. a 


12.3 STANDARD DERIVED FUNCTIONS 
12.3.0 Introduction 


The rules of differentiation obtained in the last section are sufficient 
for differentiating a rational function, that is, any function formed from 
polynomials by arithmetic operations (addition, multiplication, division) 
and by composition, and also for differentiating the inverse (if it exists) 
of such a function, This class of functions is quite extensive, but it does 
not include some very useful functions such as the trigonometric, exponen- 
tial and logarithm functions. In the present section we shall fill some of 
the gaps. In each case it is the result, not the method of obtaining it 
that you need to know, although we shall set some of the results as 
exercises so that you can get practice at applying the rules previously 
demonstrated. These standard derived functions are often called standard 
forms. In this section they are printed in red and for convenience they 
are also collected together, with a list of the rules of differentiation, at 
the end of this correspondence text, just before the appendices, 


12.3.1 Trigonometric Functions 


Denoting the derived function of the sine function by sin’, we have: 


ae . sin(x +h) ~ sinx 
sin’ x = lim ——____—_. 
h=0 h 
The evaluation of this limit involves some minor technicalities, which are 
not an essential part of the technique of differentiation and have therefore 
been excluded from this Foundation Course. (If you would like to read 
how the limit is evaluated, turn to Appendix II.) What does concern us 
here is the actual value of the limit: it turns out to be simply cos x, so 
that the formula for differentiating sine is 


sin’'x =cosx (xe R) 
or simply 
sin’ = cos. 
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1234 
Maes 


We can also differentiate the function cos (i.e. x-— cos x (x € R)) using 
the limit definition of the derivative, but there is a possibly more elegant 
method which uses the identity : 


cos x = sin 5 = | (xe R). 


Exercise 1 


Use the identity 


cos x = sin (5 - | (xe R), 


the composite function rule, and the fact that sin’ = cos, to find the 
derivative of cos. a 


Exercise 2 
Differentiate the tan function. To do this you may need to use: 
i sin x 
(i) tan x = ——, 
cos x 
i) the quotient rule, 
(iii) the trigonometric identity cos? x + sin? x = 1. 


What is the domain of the tan function? What is the domain of its derived 
function? a 


Exercise 3 


Differentiate: 


4 1 
(i) see = sor 


1 


ii) cot : 
( ) tan 


eat 1 
sec = —, 
(ili) cose mn 


The domain of the derived function is, in each case, the same as the domain 
of the original function. In cases (i) and (iii) the domain is R except for 
those numbers x for which cos x and sin x (respectively) are zero. In 
the case of (ii) the domain is R except for those numbers x for which 
tan x is undefined and for which tan x is zero. It is usual, however, to 
define cot x to be zero for those x for which tan x is undefined. a 


Exercise 4 


Find D? (sin), D® (sin), D* (sin), D® (sin), Generalize your results by writing 
down a formula for D" (sin) that holds for every positive integer n. a 


Exercise 5 (Optional) 


Motions in which the position x of an object at time ris given by a formula 
such as 


X — Xo = asin b(t — to), 
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(See RB10) 


Exercise | 
(3 minutes) 


Exercise 2 
(3 minutes) 


(See RB1O) 


Exercise 3 
(3 minutes) 


Exercise 4 
(3 minutes) 


Exercise 5 
(4 minutes) 
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(where x9, fo, a and b are fixed numbers) are frequent in physics and 

mechanics. They are called simple harmonic motions, For example, any Definition 1 
point on a clock pendulum or on a vibrating piano string executes a % 
motion that is approximately simple harmonic. Calculate the velocity 

and also the acceleration (the rate of change of velocity) for a moving Definition 2 
body whose position is given by the above formula, and find the function i 


(position)— (acceleration). 


The form of this function is characteristic of simple harmonic motion. 1 
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Solution 1 Solution 1 
Let 

gue —x (xeR) 

S:x-—>sinx (xe R), 
Then using (f > g)' = (f’eg) x g', we have 


cos’ = [cos - (x = | x (x-—> = 1) 


nia 


F ud 
cos' x = —cos |= — x]. 


This result can be simplified by noting that cos (§ — x} = sin x (xe R), 
so that 
cos’ = —sin. a 
Solution 2 ‘Solution 2 


Starting with 


x ‘ 
———— (quotient rule) 


cos? x + sin? x 


cos* x 


(cos? x + sin? x = 1) 


= sec? x sec x = ——], 
c 


This result can be written as 
tan’ = sec? 


The domain of tan is R except for 


=3 cet oes LOR Paka 
> BBM Ghee 
The derived function has the same domain. a 
Solution 3 Solution 3 


The rule for differentiating quotients of the form: 


u(x) = ws) (xe R, where w(x) 4 0) 
w(x) 


_ w(x)e'(x) — vfx)w’(x) 


u(x) (win)? 
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If v(x) = 1, e'(x) = 0, so that the quotient rule becomes : 


1 
(i) When u(x) = ——, then w(x) = cos x and w(x) = —sin x, so that 
cos x 


; sin x 
u(x) = —s— = sec x tanx 
cos* x 
ie. 
sec’ = sec tan 


(ii) When u(x) = — then w(x) = tan x and w(x) = sec? x, so that 


vite) = Bo aie y cont Py = es 
tan’ x cos*x sin’ x sin’ 

ie, 

cot’ = —cosec? 
(iii) u(x) = ae w(x) = sin x; and so w(x) = cos x. Thus 

u(x) = ees = —cosec x cot x 
sin’ x 
ie. 

cosec’ = —cosec cot. 1] 
Solution 4 Solution 4 

Dsin = cos 

D* sin = Dcos = —sin 

D* sin = D(D* sin) = D(—sin) = —cos 

D* sin = D(D* sin) = D(—cos) = sin 

D* sin = D(D* sin) = D(sin) = cos. 
Suppose n is an odd integer i.e, we can write 

n=2k+1 where ke {0,1,2,...}, 
then 

D**" sin = (= 1) cos. 
If nis an even integer i.e. we can write 

n=2k, (ke{1,2,3,...}) 
then 

D**sin = (—1)}'sin. a 
Solution 5 Solution 5 


The position of the object at time 1 is 
X = Xp + asin b(t — to). 
The velocity-function of the object is D(t-— x), 
In the above expression, a, b, xo and {» are all fixed numbers (constants). 
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To find the velocity, we must find 
D(t-— asin b(t — f)). 


Let git b(t — to), so that g’:t-——+h and f = sin, so that f’ = cos. 
Then 


(fog) = (t-—+sin b(t — to)! 

Now 

((f'eg) x g'](t) = cos bit — ty) x b 

= bcos b(t — to). 

It follows that 

D(t-—+ sin b(t — to)) = t-—+ bcos b(t — to), 
and hence that 

D(t-— asin b(t — to)) = t-—> ab cos b(t — to), 
so the velocity at time ¢ is ab cos b(t — to). 
Similarly the acceleration function is D*(t-—+ x); so the acceleration 


at time ¢ is —b?a sin b(t — to). The function (position)—+ (acceleration) 
is therefore 


x-—+ —b?a sin b(t — to), 


or substituting for ¢ from the formula for position: 


x—+ —b*(x — x9) (xe R). a 
12.3.2 Exponential and Logarithmic Functions 12.3.2 
Denoting the derived function of the exponential function by exp’, we Main Text 


have from the definition of a derivative: 


oe tim SP aa — exp (x) (xeR) 


= lim os by the exponential theorem (see Unit 7, 
Lae Sequences and Limits 1) 
. e(e— 1) 

= lim ———— 
no 

= Ce* . 

= Cexpx 


where C denotes 


It is shown in Appendix III that this limit exists and C = 1, and hence 
exp’ = exp. ‘Equation (1) 
Thus the derived function of exp is the exponential function itself, 


This is a remarkable result, but it is no more than might be expected 
from our introduction to the exponential function via the population 
growth example which was used in Unit 7, Sequences and Limits I. In 
the model used there, the change in the world’s population in a given 
interval of time was assumed to be proportional to the population itself. 


48 


Since differentiation gives the instantaneous rate of change, it is perhaps 
not so surprising that in differentiating exp we find that the derived 
function equals exp; we are, after all, considering a quantity (the world’s 
population) whose instantaneous rate of change was (apart from a con- 
stant of proportionality) equal to the quantity itself. 

The exponential function crops up frequently in applied mathematics 
because applied mathematics problems (such as the population growth 
problem) frequently require a function whose derived functions are 
related to the function itself in a way that can be reduced to Equation (1) 
by suitable manipulations. 


Exercise 1 
Use the rule for differentiating inverse functions to show that 


1 
ln (ve R*). a 


x 


Exerci, 


Show that, for any real function /, with codomain R*, 
r 
(Ino fy ==. 
J f a 
Exercise 3 


A useful technique, called logarithmic differentiation, for differentiating 
a complicated product, f, is to form the composite function, Ine /, find 
its derivative and then obtain f’ by using the result in Exercise 2, 

(i) Apply this method to the function 


1 +x 
I + 2x, 


v—e( ) (we Rand x 4 ~—}. 


(ii) Use this method to differentiate x-—+ x* (xe R"), where x is any 
real number. a 
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Exercise | 
(3 minutes) 


Exercise 2 
(3 minutes) 


Exercise 3 
(5 minutes) 
Definition 1 
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Solution 1 Solution 1 
Since In is the inverse function of exp, the inverse function rule gives 


1 1 1 


lS exp’ (In x) = exp (In x) ce 


+ 
- (xe R*). a 
Solution 2 


Solution 2 
The composite function rule gives 


(ins fy =(In'ef) x f’ = f . 


Solution 3 Solution 3 
(i) Let 


1+), 
fo) = ef) 


then 
In f(x) = x + In(1 + x) — In(1 + 2x) 


— pease on differentiating 
fe) “i+x 1+2x 


a Fa) = foo 1 + : Z ] 


T+x D+ 2x 
te 
=o} 


( euse al. 
1+x 14+ 2x 
(ii) Let 
forex ‘te Ri) 
f(x) = x* 
In f(x) = In(x*) = aln x, 
Differentiating gives 


orse\ ea Le (9c) om eemepmnd 
SI')= aS) = :* = ax’ 


S':x—raxt' (xe Rt), wee 
We have therefore shown that the function 

Sixx (xe R*) 
has derived function 

f':x-— ax"! (xe R*) 


where a is any real number. i | 


50 


12.3.3 Technique of Differentiation 


This section is devoted to some exercises to help you learn to apply the 
rules of differentiation. If you get into difficulties with any of them do 
not spend a lot of time on it, but look up the solution and compare it with 
your own attempt, to see where you went wrong, The results of some of 
these exercises will be useful in Unit /5, Differentiation I, There is nothing 
essentially new in this section, so if you are pressed for time you can leave 
it for the time being. 


Exercise 1 


Differentiate ; 
(i) fox tt (xeR*); 


(ii) f:x-—> x exp(—x) (xe R); 


ae (x — 3)8 : 
(ili) Es i => (xe RA x> 3), | 
Exercise 2 

(i) If 

Sir—aal# + | (reR*) 
mm 
find S‘(r). 
(ii) If 


fix /x? $y? (xe R) 


where y is some real number (not zero), find f’. 


(iii) If 
P(x) = (x — 1)? x (x + 27 
defines the function P with domain R, find P(x). a 
Exercise 3 


Differentiate the function r defined by 
= U) 
10) = seco + $= tan (a [> ‘l) 
1 2 < 


where d, s, v,, and v3 are positive real numbers. a 
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Exercise 1 


(I minute) 


(2 minutes) 


(2 minutes) 


Exercise 2 


(1 minute) 


(2 minutes) 


(2 minutes) 


Exercise 3 
(3 minutes) 
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Solution 1 Solution 1 


(i) 


(ii) 


(iii) 


Standard derived functions give 
(x-—> x)" = (x 1), 
and 
(xe x) = (x — x); 


it follows by the addition rule that 


1 
[»—+ 4) = (st - | (ve R*). 
x | 
The given function has the product form 
S=sh 
where 
g(x) = x (xe R) 


h(x) = exp(—x)> (xe R), 


Moreover, h(x) is of the form exp (k(x)) where 
k(x) = —x (xe R). 


By the “function of a function” rule, the derived function of h is 
given by 


h(x) = exp’ (k(x) x k(x) = exp(—x)(—1) 
= —exp(—x). 
By the product rule, the derived function of f is therefore given by: 
f(x) = gh(x) + gxVh'(x) 
= lexp(—x) + x(—exp(—x)) 
= (1 — x)exp(—x) (x R). 
We could differentiate directly using the quotient rule, but it is easier 
to use logarithmic differentiation (see Exercise 12.3.2.3), That is, we 
write down the formula for f(x) as usual 
(x — 3)8 
(x — 2)* 


L(x) = 
but before differentiating we take natural logarithms, obtaining 
In f(x) = 3In(x — 3) — 4In(x — 2) 
and then, by the “function of a function” rule, 


In’ (f(x) f(x) = 3 In’ (x = 3) — 41n'(x — 2) 


(since (x +x — 3) =x +1 and (x +x — 2) =x-—— 1), 


Since In’ (u) = ‘ this simplifies to 


(xeRAx>3). @ 
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Solution 2 Solution 2 


i 1000! 
(i) S(r) = 2x( + = 
Lie 
Standard derived functions and the rule for constant factors give 


(r-—> r?y = (r-—> 2h), 


and 


therefore the addition rule and the rule for multiplication by a 
constant give 


Sin) = 2x(2r - wr lire R*). 
7am 


(ii) The function can be expressed as a composition: 

S(x) = gth(x)) 

where 
h(x) = x7 + y? (xe R) 

and 
glu) =u (we R”). 

The derived functions of h and g are such that 
h(x) = 2x +0 (by the sum rule) 


g'(u) = gu-'/? = —.. 
g h can 


The rule for differentiating composite functions gives 
I(x) = g'(h(x)) = h(x) 


(Since y is not zero, the denominator of /"(x) is never zero. 
(iii) The given function is of the form 
P=0S 
with 
Q(x) =(x = 1)? (xe R) 
Six) = (x + 2)? (ve R). 


These functions are compositions; for example 


Q=FeG 

where 
Flu) = uw? (we R) 
G(x)=x-1 (xe R). 
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The rule for differentiating composite functions gives 
Q(x) = F(G(x)) x G(x) 
= 2G(x) x 1 
=2(x- 1). 
Similarly, we have 
S'(x) = 2(x + 2). 
The product rule therefore gives 
P(x) = Q(x)S(x) + Q(x)S'(x) 
= Ax — I(x + 2)? + (x — 1)?2(x + 2) 


= Ax — I(x + 2)(2x + Ie R)- a 


Solution 3 
We know from our list of standard forms that 
sec’ = sec 0 tan 0 
and 
tan’ 0 = sec? 0. 
It follows that 
2d —2d sec? 0 


(0) = — sec O tan 0 + 
oy U3 


which defines 1’, since the domain is already given as [0 qf. a 


12.4 SUMMARIES 
12.4.0 Summary of Unit 


We began the unit with revision of the idea of average rate of change. 
We considered distance and the average rate of change of distance, which 
we called the average velocity. The average velocity was shown to be 
inadequate as a measure of the instantaneous velocity, and a difficulty 
in this last definition was overcome by means of a limit process. We 
thus arrived at the definition of the derivative of a function at an element 
in its domain, and this was generalized to obtain the derived function. 
In the rest of the unit we developed techniques for differentiation, We 
obtained rules for differentiation of functions in the forms: 


f 


f+efx Bote g, and the inverse of a one-one function /. 


We then applied these rules to polynomial, trigonometric, exponential 
and logarithmic functions. 
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Solution 3 


4 
R40 


Summary 
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12.4.1 Summary of Rules and Formulas for Differentiation 141 


Rules for differentiation 


| Ifg =af, then g’ = af". 
2 Ih=/ +g, then h’ = f’ + g' (sum rule), 
3 Ifh = fg, then h' = f'g + fg' (product rule). 


f if 


4 Ifh ==, then h’ = (quotient rule), 
& 
5 Ifh= fog, then h' = (fg) x g' (composite function rule), 
6 If fis one-one and g is the inverse of f, then f’ = paye 
Standard derived functions 
fi f Domain of {* Remarks 
xan xiang t R meZ* 
L— XN xin Rwithou0 meZ 
Lie x——b nxt R* meR 
x'—rexpx x—expx R 
1 
x—Inx x R’ 
x 
xh—sinx xt—* cos x R 
xi—+cosx 9 xi — sin x R 


Some further derived functions (obtainable from the above) 


t f" Domain of f’ 
xe— lan x x—+ sec*'x. R excluding 
& 3m 
+5 tye 
xe—* sec x xX+— sec x tan x R excluding 


x-— cot x x—> — cosec? x 


x+—*cosec x Xt— — cosec x cot x 


Q bay 2m, 2. 
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12.5 APPENDICES (Not part of the course) 
12.5.1 Appendix I: The Leibniz Notation 


Probably the most widely used notation for calculus is the one invented 
by Leibniz. In this notation, if x is a variable representing an element in 
the domain of a function, f, and y is another variable whose value is 
related to that of x by 


y= f(x) (xedomain of f), 


then we use the symbol w to stand for /"(x), That is, we define 
dx 
ery cl i 
an f(x) (xedomain of /'). 
BES 


The advantage of the Leibniz notation is its conciseness; for example, 
the formula giving the derived function of x-—> x? (x € R) can be written 
d(x") = 2% 
dx 
instead of 


(xt—+ x?) = (xe-—> 2). 


dy . oe 
The process of calculating 2. when y is defined as an expression involving 
d 


x (for example the x? in the above example), is called differentiating this 
expression with respect to x. Formally, we may say that to differentiate 
y with respect to x is to calculate f(x) where f is the function defined by 


fi:x-—+*y, 
The disadvantage of the Leibniz notation is that it contains some traps 


for the beginner, arising from the difficulty of assigning an independent 
meaning to the symbols dx and dy when they occur separately (rather 


than locked together in the combination @) Since the principles of 


calculus can be understood without using Leibniz notation, it is not 
treated as an essential part of this Foundation Course. 


Thus the rules for differentiating sums and products in the Leibniz nota- 
tion become 


du +v)_ du | dv 


dx ~ dx * dx’ 

d(cu) du, 

—— = c—~ if isa constant 
dx dx 


(i.e. if x-— ¢ is a constant function), 
a) Ae 
idx) da de 


To express the rule for differentiating composite functions in this notation, 
let x, y and z be variables related by 


Y= g(x) (x domain of g) 
z= fly) (y edomain of /') 
so that 
x) = : domain of g’) 
aL ays (xe domain of g’), 


Cee he 
Peo) = f'0) =F 


(yedomain of f"), 
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Then the rule for differentiating a function h defined by 
h(x) = f(g(x)) (x domain of h) 


h(x) = f'(g(x)) * g(x) (xe domain of h’) 
which takes the simple form 

ee 

dx dy 
This rule is called the chain rule and is easy to remember because one 
can think of dy as “cancelling out” from the expression on the right. 


a 
dx’ 


The rule for differentiating inverse functions also takes a convenient 
form in this notation: it is 


The proof is similar to the proof of the chain rule. We shall not go into 
details here. 

There is a very good discussion of the Leibniz notation on page 171 of 
T. M. Apostol, Calculus (see Bibliography). (If you have the 1961 edition 
of Apostol, see page 127.) 


12.5.2 Appendix II: Differentiation of sin and cos 12.5.2 


‘ HS oe pair 4 Appendix I 
To differentiate sin we use the definition of a derivative, obtaining 


sin’ (x) = lim 
bas 


(eee + h) — sin ‘) 
0 


h 
= Ii sin x cosh rs cosxsinh — sin x 
“heel hi h nl 


Collecting together the parts that are proportional to cos x and sin x, we 
have: 


n=O 


ce sinh ‘ . cosh —1 

sin’ (x) = cos x lim = tsinx lim ———— Equation (1) 
hoo hoo 

(since cos x and sin x are independent of h), The evaluation of the limit of 


anh at 0 can be demonstrated on the diagram: 
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The are PQ subtends an angle h radians at the centre O of the circle, which 
has unit radius. PR is a tangent and OQR isa straight line. In this diagram 
there are three areas that can be simply expressed in terms of the angle 
hand its trigonometric functions: 


triangle OPR = } x base x height = 40P x PR 
SOP x OP tanh = $tanh 


h 
shaded sector OPQ = x » area of circle 


h 2 
= 55 * ROP? = Sh 


triangle OPQ = 4 base = height 
= 40P x OO sinh = 


sinh. 


Geometrical intuition tells us that {iro <h< 3) 


triangle OPQ < sector OPQ, i.e. sinh < th 
and 

sector OPQ < triangle OPR, i.e. th < }tanh, 
The fact that we are relying on geometrical intuition here means that the 
result of this appendix is demonstrated, not proven, after all, the definition 
of sin that we are using is itself geometrical, The whole argument, in- 


cluding the definition of the trigonometric functions, can be made rigorous 
without any reliance on geometrical intuition. 


Since we are assuming h > 0, we may multiply both sides of the inequality 
$sinh < $h by ‘ and since we are assuming h < 5: which (with h > 0) 
implies cosh > 0, we may multiply both sides of the inequality 
th < Stanh by 2 Combining the two resulting inequalities we 


obtain: 


O<h< 


ist 


(For the rules for the manipulation of inequalities see Unit 6, Inequalities.) 
inh 

Since both cosh and = stay the same when the sign of h is changed, 

these inequalities imply 


coon ag (-E<n<o}. 
h 2 


We know from the geometrical definition of cosh that we can make 
cos has close to | as we please by making / small enough, i.e. that 


lim cosh = 1. 
co) 


inh 
When |h] is very small, therefore, the value of is sandwiched in between 


Peniea sinh 
1 and a number whose limit is 1. It follows that = must also have the 


limit 1 near 0; 
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Equation (2) 


To evaluate the other limit in Equation (1) we reduce it to the preceding 


one, obtaining 


since lim and lim 
hwo a/2h=0. 
Equations (2) and (3) in Equation (1), we have 


sin'(x)=cosx (xe R). 


el 


12.5.3 Appendix III: Evaluation of lim i 
h~0 


are the same, by the definition of a limit. From 
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12.5.3 


(See RBIO) 


Equation (3) 


12.5.3 


Appendix 111 


We use the definition of exp given in Section 7.4.1 of Unit 7, Sequences and 


Limits I, obtaining 


Tee 
mag h 
k hon-1 hn 
i Sage ti 
eran h 


by the binomial theorem. If h > 0, the numerator satisfies 
h? hn 

h < numerator <h + > +05 + 

2 n 
Sh+h pe the 


1—h" h 4 
= Sirey. provided h < 1. 


On using these inequalities in Equation (1), we find that 


PP ee: 
sea when 0<h <1. 


1 
a h = tA 
If h < 0,a similar method shows that 
1 jh 
21 T=iif 
= 1 and that lim 1 — 
rr) 


Now we know that lim 
nmol —h 


whether h is positive or negative, when it is very small the value of 


must be very close to 1, and we conclude that 


et 


Equation (1) 


(See RB8) 
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